The Log-Log Term Frequency Distribution

نویسنده

  • Jason D. M. Rennie
چکیده

Though commonly used, the unigram is widely known as being a poor model of term frequency; it assumes that term occurrences are independent, whereas many words, especially topic-oriented ones, tend to occur in bursts. Herein, we propose a model of term frequency that treats words independently, but allows for much higher variance in frequency values than does the unigram. Although it has valuable properties, and may be useful as a teaching tool, we are not able to find any applications that make a compelling case for its use.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

نوع گونه‌های گیاهی علفی و چوبی در رابطه با موقعیت‌های مختلف فیزیوگرافی با استفاده از شاخص‌های عددی و غیرعددی در جنگل‌های کوهستانی زاگرس

Numerical and parametrical indices are two of the most important and most widely used indices for assessing biodiversity in different societies. Nevertheless, their efficiency and selection in various ecosystems has always been one of the challenging issues of ecology. In this study, the numerical indices of diversity, richness and eveness, as well as abundance–ranking distribution models inclu...

متن کامل

Low flow frequency analysis by L-moments method (Case study: Iranian Central Plateau River Basin)

Knowledge about low flow statistics is essential for effective water resource planning and management in ungauged orpoorly gauged catchment areas, especially in arid and semi-arid regions such as Iran. We employed a data set of 20 riverflow time-series from the Iranian Central Plateau River Basin, Iran to evaluate the low-flow series using several frequencyanalysis methods and compared the resu...

متن کامل

The Zografos–Balakrishnan-log-logistic Distribution

Tthe Zografos–Balakrishnan-log-logistic (ZBLL) distribution is a new distribution of three parameters that has been introduced by Ramos et el. [1], and They presented some properties of the new distribution such as its probability density function, The cumulative distribution function, The  moment generating function, its hazard (failure) rate function, quantiles and moments, Rényi and Shannon ...

متن کامل

Log-Normal and Mono-Sized Particles’ Packing into a Bounded Region

Many systems can be modeled with hard and various size spheres, therefore packing and geometrical structures of such sets are of great importance. In this paper, rigid spherical particles distributed in different sizes are randomly packed in confined spaces, using a parallel algorithm. Mersenne Twister algorithm was used to generate pseudorandom numbers for initial coordination of particles. Di...

متن کامل

A New Design of Log-Periodic Dipole Array (LPDA) Antenna

This paper presents a new approach for design of the log-periodic dipole array antenna (LPDA) based on using of different design parameters in the LPDA elements to control the antenna behavior. In the proposed procedure, the design parameters can control the value of forward gain over the operating frequency range, and also adjust the gain flatness. Furthermore, this design procedure can decrea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005